Bioinformatics pipeline to identify candidate regulatory variants in late‐onset Alzheimer’s disease (LOAD) associated regions

نویسندگان

چکیده

Background Small insertions and deletions (indels) in the human genome are substantial contributors to genetic variation impact traits diseases. The role of indels late onset Alzheimer’s disease has been understudied. Few examples, such as intronic poly-T variant TOMM40 gene, suggest that systematic exploration within LOAD risk regions will advance understanding architecture LOAD. Previously we developed a bioinformatics pipeline characterizes prioritizes candidate regulatory SNPs enhancers located LOAD-GWAS regions. Here extend analysis indels. proposed progresses from filtered set variants have predicted strong effect on transcription factor (TFs) binding. Method utilized publicly available functional genomics data sources. Primarily, cis-regulatory elements (cCREs) ENCODE single-cell RNA-seq patient samples (synapse: syn22079621). For TF binding employed motifs MotifDb. In addition, used various software including motifbreakR. Result We catalogued 1230 proximal CTCF-bound regions, 912 showed epigenetic evidence relevant brain tissue. 426 these cCREs. These disrupted 391 TFs, 362 had snRNA-seq samples. Of note, APOE-TOMM40, SPI1 MS4A2 were significantly by Amongst TFs RUNX3, SMAD3. Interestingly, significant findings with consistent our prior results for SNPs. Conclusion This study provides an analytical framework catalogue noncoding indel loci characterize their likelihood perturb approach integrates multiple types prioritize genes validation experiments using models gene editing technologies.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From Genome to Candidate Cis-regulatory Networks: A Bioinformatics Approach

Motivation: Present technological enhancements have resulted in public databases containing data sets of various types: gene expression, protein-DNA interaction and transcription factor (TF) activity data, protein-protein interactions, and genomic sequence and ontology information. The analysis of these large volumes of information holds the promise of identification of the nonlinear dynamic fu...

متن کامل

From Genome to Candidate Cis - Regulatory Networks : A Bioinformatics Approach

Present technological enhancements have resulted in public databases containing data sets of various types: gene expression, protein-DNA interaction and transcription factor (TF) activity data, protein-protein interactions, and genomic sequence and ontology information. The analysis of these large volumes of information holds the promise of uncovering the complex dynamic function of the biochem...

متن کامل

Can modular analysis identify disease-associated candidate genes for therapeutics?

Complex diseases such as allergy change gene expression in several cell types and tissues. Benson and colleagues have now shown, in a paper in BMC Systems Biology, that this complexity can be studied effectively using an integrated experimental and computational modular analysis. Their strategy revealed a core of allergy-associated genes of potential therapeutic value.

متن کامل

Breakthrough Technologies A Motif and Amino Acid Bias Bioinformatics Pipeline to Identify Hydroxyproline-Rich Glycoproteins

Intrinsically disordered proteins (IDPs) are functional proteins that lack a well-defined three-dimensional structure. The study of IDPs is a rapidly growing area as the crucial biological functions of more of these proteins are uncovered. In plants, IDPs are implicated in plant stress responses, signaling, and regulatory processes. A superfamily of cell wall proteins, the hydroxyprolinerich gl...

متن کامل

identify regulatory motifs.. Bioinformatics. Vol 19:18 (2369-2380)

Identification of regulatory motifs in DNA sequences is made difficult primarily by their degeneracy. Computational techniques to find statistically over-represented sequence profiles are aided by inputting sequences in which motifs are fairly certain to be found. Prudent selection of orthologous genes ensures that species are sufficiently diverged to have low sequence similarities in regions n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Alzheimers & Dementia

سال: 2023

ISSN: ['1552-5260', '1552-5279']

DOI: https://doi.org/10.1002/alz.061063